Goto

Collaborating Authors

 end pose


Robot Crash Course: Learning Soft and Stylized Falling

Strauch, Pascal, Müller, David, Christen, Sammy, Serifi, Agon, Grandia, Ruben, Knoop, Espen, Bächer, Moritz

arXiv.org Artificial Intelligence

Despite recent advances in robust locomotion, bipedal robots operating in the real world remain at risk of falling. While most research focuses on preventing such events, we instead concentrate on the phenomenon of falling itself. Specifically, we aim to reduce physical damage to the robot while providing users with control over a robot's end pose. To this end, we propose a robot agnostic reward function that balances the achievement of a desired end pose with impact minimization and the protection of critical robot parts during reinforcement learning. To make the policy robust to a broad range of initial falling conditions and to enable the specification of an arbitrary and unseen end pose at inference time, we introduce a simulation-based sampling strategy of initial and end poses. Through simulated and real-world experiments, our work demonstrates that even bipedal robots can perform controlled, soft falls.


Learning to Pick: A Visuomotor Policy for Clustered Strawberry Picking

Fei, Zhenghao, Lu, Wenwu, Hou, Linsheng, Peng, Chen

arXiv.org Artificial Intelligence

Abstract--Strawberries naturally grow in clusters, interwoven with leaves, stems, and other fruits, which frequently leads to occlusion. This inherent growth habit presents a significant challenge for robotic picking, as traditional percept-plan-control systems struggle to reach fruits amid the clutter . Effectively picking an occluded strawberry demands dexterous manipulation to carefully bypass or gently move the surrounding soft objects and precisely access the ideal picking point--located at the stem just above the calyx. T o address this challenge, we introduce a strawberry-picking robotic system that learns from human demonstrations. Our system features a 4-DoF SCARA arm paired with a human teleoperation interface for efficient data collection and leverages an End Pose Assisted Action Chunking Transformer (ACT) to develop a fine-grained visuomotor picking policy. Experiments under various occlusion scenarios demonstrate that our modified approach significantly outperforms the direct implementation of ACT, underscoring its potential for practical application in occluded strawberry picking. LOBAL demand for strawberries, a high-value crop, continues to rise. While China led production in 2023 with 3,336,690 tons, followed by the US with 1,055,963 tons [1], harvesting remains labor-intensive due to the fruit's fragility. This contrasts with mechanized harvesting of crops like corn and wheat.


HumanMAC: Masked Motion Completion for Human Motion Prediction

Chen, Ling-Hao, Zhang, Jiawei, Li, Yewen, Pang, Yiren, Xia, Xiaobo, Liu, Tongliang

arXiv.org Artificial Intelligence

Human motion prediction is a classical problem in computer vision and computer graphics, which has a wide range of practical applications. Previous effects achieve great empirical performance based on an encoding-decoding style. The methods of this style work by first encoding previous motions to latent representations and then decoding the latent representations into predicted motions. However, in practice, they are still unsatisfactory due to several issues, including complicated loss constraints, cumbersome training processes, and scarce switch of different categories of motions in prediction. In this paper, to address the above issues, we jump out of the foregoing style and propose a novel framework from a new perspective. Specifically, our framework works in a masked completion fashion. In the training stage, we learn a motion diffusion model that generates motions from random noise. In the inference stage, with a denoising procedure, we make motion prediction conditioning on observed motions to output more continuous and controllable predictions. The proposed framework enjoys promising algorithmic properties, which only needs one loss in optimization and is trained in an end-to-end manner. Additionally, it accomplishes the switch of different categories of motions effectively, which is significant in realistic tasks, e.g., the animation task. Comprehensive experiments on benchmarks confirm the superiority of the proposed framework. The project page is available at https://lhchen.top/Human-MAC.